Anti-parallel Coiled Coils Structure Prediction by Support Vector Machine Classification

نویسندگان

  • Zhong Huang
  • Yun Li
  • Xiaohua Hu
چکیده

Coiled coils is an important 3-D protein structure with two or more stranded alpha-helical motif wounded around to form a “knobs-into-holes” structure. In this paper we propose an SVM classification approach to predict the antiparallel coiled coils structure based on the primary amino acid sequence. The training dataset for the machine learning are collected from SOCKET database which is a SOCKET algorithm predicted coiled coils database. Total 41 sequences of at least two heptad repeats of the anti-parallel coiled coils motif are extracted from 12 proteins as the positive datasets. Total 37 of non coiled coils sequences and parallel coiled coils motif are extracted from 5 proteins as negative datasets. The normalized positional weight matrix on each heptad register a, b, c, d, e, f and g is from SOCKET database and is used to generate the positional weight on each entry. We performed SVM classification using the cross-validated datasets as training and testing groups. Our result shows 73% accuracy on the prediction of anti-parallel coiled coils based on the cross-validated data. The result suggests a useful approach of using SVM to classify the anti-parallel coiled coils based on the primary amino acid sequence.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Porosity Prediction of One of Iran South Oil Field Carbonate Reservoirs Using Support Vector Regression

Porosity is considered as an important petrophysical parameter in characterizing reservoirs, calculating in-situ oil reserves, and production evaluation. Nowadays, using intelligent techniques has become a popular method for porosity estimation. Support vector machine (SVM) a new intelligent method with a great generalization potential of modeling non-linear relationships has been introduced fo...

متن کامل

PREDICTION OF SLOPE STABILITY STATE FOR CIRCULAR FAILURE: A HYBRID SUPPORT VECTOR MACHINE WITH HARMONY SEARCH ALGORITHM

The slope stability analysis is routinely performed by engineers to estimate the stability of river training works, road embankments, embankment dams, excavations and retaining walls. This paper presents a new approach to build a model for the prediction of slope stability state. The support vector machine (SVM) is a new machine learning method based on statistical learning theory, which can so...

متن کامل

Robustified distance based fuzzy membership function for support vector machine classification

Fuzzification of support vector machine has been utilized to deal with outlier and noise problem. This importance is achieved, by the means of fuzzy membership function, which is generally built based on the distance of the points to the class centroid. The focus of this research is twofold. Firstly, by taking the advantage of robust statistics in the fuzzy SVM, more emphasis on reducing the im...

متن کامل

Socket: a program for identifying and analysing coiled-coil motifs within protein structures.

The coiled coil is arguably the simplest protein-structure motif and probably the most ubiquitous facilitator of protein-protein interactions. Coiled coils comprise two or more alpha-helices that wind around each other to form "supercoils". The hallmark of most coiled coils is a regular sequence pattern known as the heptad repeat. Despite this apparent simplicity and relatedness at the sequence...

متن کامل

Feature Selection Using Multi Objective Genetic Algorithm with Support Vector Machine

Different approaches have been proposed for feature selection to obtain suitable features subset among all features. These methods search feature space for feature subsets which satisfies some criteria or optimizes several objective functions. The objective functions are divided into two main groups: filter and wrapper methods.  In filter methods, features subsets are selected due to some measu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006